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SEQUENCE LISTING 

<110> University of Rochester 
Maquat, Lynne E. 



<120> NONSENSE -MEDIATED MRNA DECAY 



<130> 21108. 0023P1 

<150> 60/405,602 
<151> 2002-08-22 

<160> 38 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 1 

gcctattggt ctattttccc 

<210> 2 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 2 

cctgaagttc tcaggatc 

<210> 3 
<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 3 

atctggcacc acaccttcta caatgagctg eg 

<210> 4 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 4 

cgtcatactc ctgcttgctg atccacatct gc 

<210> 5 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 5 

tgcaaggagt ttcatcctg 

<210> 6 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 6 

agaatcagta gtttaacaca c 

<210> 7 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 7 

tgagcatagt tattaatagc ag 

<210> 8 
<211> 77 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 8 

gctagctcga gaccggtgcc accatggact acaaagacga tgacgacaag gcggaagggc 60 
tggagcgtgt gcggatc 77 



<210> 9 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 9 

tttaaacccg gcctgcgggg ccagagtagc caggatcccg cgc 

<210> 10 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 10 

tgaccttcag cgcctcgg 

<210> 11 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 11 

ctccgagtcc ctctgcc 

<210> 12 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 12 

ggcaaaggct ctgagaagc 

<210> 13 
<211> 17 
<212:> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 13 

ccgaggtccc aaaggcg 

<210> 14 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 14 

atcgaagatc tggatccaag gtcgggcagg agagggcct 

<210> 15 
<211> 59 
<212> -DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 15 

tacacaaagc aatgtccatt acatgccacg gtgtttcgtc ctttccacaa gatatataa 

<210> 16 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 16 

cgaaatctag aaaaaagtgg catgtaatgg acattgccta cacaaagc 

<210> 17 

<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 17 

gcugcagcag aacaggccat t 

<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 18 

guacaaccca ggauaugugt t 

<210> 19 
<211> 59 
<212> DNA 

<213> Artificial Sequence 



4 
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<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 19 

tacacaaaca gggctgttct tcgagatgcg gtgtttcgtc ctttccacaa gatatataa 59 

<210> 20 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 20 

cgaaatctag aaaaaagcat ctcgaagaac agccctgcta cacaaaca 4 8 

<210> 21 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 21 

tacacaaagc aatgtccgtt gcatgccacg gtgtttcgtc ctttccacaa gatatataa 59 

<210> 22 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 22 

cgaaatctag aaaaaagtgg catgcaacgg acattgccta cacaaagc 48 

<210> 23 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 23 

tacacaaagt tcagaggctg tgtcataacg gtgtttcgtc ctttccacaa gatatataa 59 

<210> 24 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<400> 24 

cgaaatctag aaaaaagtta tgacacagcc tctgaaccta cacaaagt 48 

<210> 25 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 25 

tacacaaacc aaggcacttg ttggcagtcg gtgtttcgtc ctttccacaa gatatataa 59 

<210> 26 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 26 

cgaaatctag aaaaaagact gccaacaagt gccttggcta cacaaacc 48 

<210> 27 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 27 

gcagcgagca actgagaagc 20 

<210> 28 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 28 

gggtttagtg gtacttgtga gc 22 

<210> 29 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
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synthetic construct 

<400> 29 

gactgagccg atcccgcgc 

<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 30 

gcagtaacgg cagacttctc 

<210> 31 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
'synthetic construct 

<400> 31 

cctttcctgc tcttgcctg 

<210> 32 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 32 

gctttttatt tgtcagaaga cag 

<210> 33 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 33 

atctggcacc acaccttcta caatgagctg 

<210> 34 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 
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<400> 34 

cgtcatactc ctgcttgctg atccacatct 30 

<210> 35 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 35 

atgacttcga aagtttat 18 

<210> 36 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 36 

ttcagatttg atcaacgca 19 

<210> 37 
<211> 1419 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 37 



Met 


Ala 


Glu 


Gly 


Leu 


Glu 


Arg 


Val 


1 








5 








Gly 


He 


Leu 


Ala 


Thr 


Leu 


Ala 


Pro 






20 










Lys 


Glu 


Leu 


Lys 


Glu 


Ala 


Arg 


Pro 






35 










40 


Leu 


Glu 


He 


Tyr 


Lys 


Pro 


Gly 


Leu 




50 










55 




He 


Lys 


Glu 


Pro 


Pro 


Gly 


Ser 


Glu 


65 










70 






Asp 


Arg 


Asp 


Cys 


Ser 


Ala 


Val 


Glu 










85 








Val 


Cys 


Lys 


Glu 


Leu 


Asn 


Asn 


Gin 








100 










Glu 


Asn 


Asn 


Arg 


Gly 


Gin 


Glu 


Ser 






115 










120 


Asp 


Arg 


Ser 


Leu 


Lys 


He 


He 


Lys 




130 










135 




He 


Tyr 


Gin 


Pro 


Gly 


Arg 


Arg 


Leu 


145 










150 






Ser 


Arg 


Val 


Glu 


Glu 


Glu 


Glu 


Val 



165 



Arg 


He 


Ser 


Ala 


Ser 


Glu 


Leu 


Arg 




10 










15 




Gin 


Ala 


Gly 


Ser 


Arg 


Glu 


Asn 


Met 


25 










30 






Arg 


Lys 


Asp 


Asn 


Arg 


Arg 


Pro 


Asp 










45 








Ser 


Arg 


Leu 


Arg 


Asn 


Lys 


Pro 


Lys 








60 










Glu 


Phe 


Lys 


Asp 


Glu 


He 


Val 


Asn 






75 










80 


Asn 


Gly 


Thr 


Gin 


Pro 


Val 


Lys 


Asp 




90 










95 




Glu 


Gin 


Asn 


Gly 


Pro 


He 


Asp 


Pro 


105 










110 






Phe 


Pro 


Arg 


Thr 


Ala 


Gly 


Gin 


Glu 










125 








Arg 


Thr 


Lys 


Lys 


Pro 


Asp 


Leu 


Gin 








140 










Gin 


Thr 


Val 


Ser 


Lys 


Glu 


Ser 


Ala 






155 








160 


Leu 


Asn 


Gin 


Val 


Glu 


Gin 


Leu 


Arg 




170 










175 
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Val 


Glu 


Glu 


Asp 


Glu 


Cys 








180 






Asn 


Lys 


Pro 


Asp 


Arg 


Ala 






195 








Val 


Gly 


Ala 


Ala 


Lys 


Gly 




210 










Gly 


Val 


Arg 


Glu 


Thr 


His 


225 










230 


Ala 


Lys 


Arg 


Tyr 


Ser 


Arg 










245 




Arg 


Ser 


Thr 


Ser 


Ser 


Ala 








260 






Leu 


Thr 


Asp 


Asn 


Gly 


Cys 






275 








Arg 


Pro 


Pro 


Leu 


Lys 


Lys 




290 










Asp 


Glu 


Asp 


Arg 


He 


Asp 


305 










310 


Ser 


Glu 


Arg 


Lys 


Arg 


His 










325 




Gly 


Glu 


Gin 


Lys 


Thr 


Ser 








340 






Thr 


Phe 


Asp 


Ala 


Glu 


Ala 






355 








Ala 


Arg 


Asp 


Asp 


Met 


Asp 




370 










Gly 


Gly 


Lys 


Gly 


Ser 


Glu 


385 










390 


Leu 


Arg 


Gly 


Arg 


Gly 


Arg 










405 




Leu 


Ser 


Val 


Asn 


Ser 


Ala 








420 






Arg 


Leu 


Leu 


Phe 


Gly 


Ser 






435 








Gly 


Gly 


Thr 


Thr 


Arg 


Arg 




450 










Pro 


Ala 


Leu 


Lys 


Thr 


Gin 


465 










470 


Asp 


Glu 


Val 


Ser 


Pro 


Thr 










485 




Ser 


Tyr 


Tyr 


Lys 


Phe 


Gin 








500 






Thr 


Pro 


Gly 


Pro 


Ala 


Ser 






515 








Gin 


Tyr 


Pro 


Val 


Gly 


Pro 




530 










Pro 


Gly 


Tyr 


Pro 


Thr 


Pro 


545 










550 


Thr 


Ser 


Thr 


Met 


Ser 


Pro 










565 




Gin 


Gin 


Gin 


Glu 


Leu 


His 








580 






Leu 


Gin 


Leu 


Ser 


Asn 


Leu 






595 








Leu 


Glu 


Lys 


Met 


Ala 


Gin 




610 










Arg 


Cys 


He 


Leu 


Leu 


Asp 


625 










630 


Gin 


He 


Leu 


Trp 


Lys 


Asn 



645 



Arg 


Gly 


Asn 


Val 


Ala 


Lys 






185 








Glu 


He 


Glu 


Lys 


Ser 


Pro 




200 










Glu 


Lys 


Gly 


Lys 


Arg 


Met 


215 










220 


Asp 


Asp 


Pro 


Ala 


Arg 


Gly 










235 




Ser 


Asp 


Lys 


Arg 


Arg 


Asn 








250 






Gly 


Ser 


Asn 


Asn 


Ser 


Ala 






265 








Arg 


Arg 


Arg 


Arg 


Gin 


Asp 




280 










Gin 


Val 


Ser 


Val 


Ser 


Ser 


295 










300 


Glu 


Pro 


Asp 


Gly 


Leu 


Gly 










315 




Leu 


Glu 


Arg 


Asn 


Trp 


Ser 








330 






Ala 


Lys 


Glu 


Tyr 


Arg 


Gly 






345 








Met 


Asn 


Lys 


Glu 


Ser 


Pro 




360 










Arg 


Gly 


Lys 


Pro 


Asp 


Lys 


375 










380 


Lys 


Gin 


Glu 


Ser 


Lys 


Asn 










395 




Gly 


He 


Leu 


He 


Leu 


Pro 








410' 






Gly 


Ser 


Pro 


Glu 


Ser 


Ala 






425 








Gly 


Ser 


Lys 


Gly 


Ser 


Arg 




440 










Leu 


Trp 


Asp 


Pro 


Asn 


Asn 


455 










460 


Thr 


Pro 


Gin 


Leu 


His 


Phe 










475 




Ser 


Trp 


Gly 


Asp 


Ser 


Arg 








490 






Asn 


Ser 


Asp 


Asn 


Pro 


Tyr 






505 








Gin 


Tyr 


Pro 


Tyr 


Thr 


Gly 




520 










Thr 


Asn 


Gly 


Val 


Tyr 


Pro 


535 










540 


Ser 


Gly 


Gin 


Tyr 


Val 


Cys 










555 




Glu 


Glu 


Val 


Glu 


Gin 


His 








570 






Arg 


Leu 


Leu 


Arg 


Val 


Ala 






585 








Leu 


Ser 


Arg 


Asp 


Arg 


He 




600 










Leu 


Arg 


Ala 


Glu 


Leu 


Leu 


615 










620 


He 


Glu 


Phe 


Ser 


Asp 


Asn 










635 




Ala 


Phe 


Tyr 


Gin 


Val 


He 



650 
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Glu 


Glu 


Val 


Ala 




190 






Gly 


Gly 


Gly 


Arg 


205 








Gly 


Lys 


Gly 


Glu 


Arg 


Pro 


Gly 


Ser 








240 


Arg 


Tyr 


Arg 


Thr 






255 




Glu 


Gly 


Ala 


Gly 




270 






Arg 


Thr 


Lys 


Glu 


285 








Thr 


Asp 


Ser 


Leu 


Pro 


Arg 


Arg 


Ser 








320 


Gly 


Arg 


Gly 


Glu 






335 




Thr 


Leu 


Arg 


Val 




350 






Met 


Val 


Arg 


Ser 


365 








Gly 


Leu 


Ser 


Ser 


Pro 


Lys 


Gin 


Glu 








400 


Ala 


His 


Thr 


Thr 






415 




Pro 


Leu 


Gly 


Pro 




430 






Ser 


Trp 


Gly 


Arg 


445 








Pro 


Asp 


Gin 


Lys 


Leu 


Asp 


Thr 


Asp 








480 


Gin 


Ala 


Gin 


Ala 






495 




Tyr 


Tyr 


Pro 


Arg 




510 






Tyr 


Asn 


Pro 


Leu 


525 








Gly 


Pro 


Tyr 


Tyr 


Ser 


Pro 


Leu 


Pro 








560 


Met 


Arg 


Asn 


Leu 






575 




Asp 


Asn 


Gin 


Glu 




590 






Ser 


Pro 


Glu 


Gly 


605 








Gin 


Leu 


Tyr 


Glu 


Gin 


Asn 


Val 


Asp 








640 


Glu 


Lys 


Phe 


Arg 






655 
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Gin 


Leu 


Val 


Lys 


Asp 


Pro 


Asn 


Val 


Glu 


Asn 


Pro 


Glu 


Gin 


He 


Arg 


Asn 








660 










665 










670 






Arg 


Leu 


Leu 


Glu 


Leu 


Leu 


Asp 

* Jr 


Glu 


Gly 


Ser 


Asp 

Jr 


Phe 


Phe 


Asp 


Ser 


Leu 






675 










680 










685 








Leu 


Gin 


Lvs 


Leu 


Gin 


Val 


Thr 


Tyr 


Lys 


Phe 


Lys 


Leu 


Glu 


Asp 


Tyr 


Met 




690 










695 










700 










Asp 


Gly 


Leu 


Ala 


He 


Arcr 


Ser 


Lvs 


Pro 


Leu 


Arg 


Lvs 

jr 


Thr 


Val 


Lys 


Tyr 


705 










710 










715 










720 


Ala 


Leu 


He 


Ser 


Ala 


Gin 


Arg 


Cvs 

^ Jr *^ 


Met 


He 


Cvs 


Gin 


Glv 


Asp 


He 


Ala 










725 










730 










735 




Arcr 


Tvr 


Arg 


Glu 


Gin 


Ala 


Ser 


Asp 


Thr 


Ala 


Asn 


Tvr 

Jr 


Glv 


Lvs 


Ala 


Arg 








740 










745 










750 






Ser 


Trp 


Tvr 


Leu 


Lvs 


Ala 


Gin 


His 


He 


Ala 


Pro 


Lvs 

jr 


Asn 


Gly 


Arg 


Pro 






755 










760 










765 








Tyr 


Asn 


Gin 


Leu 


Ala 


Leu 


Leu 


Ala 


Val 


Tvr 

jr 


Thr 


Arg 


Arcr 


Lys 


Leu 


Asp 




770 










775 










780 










Ala 


Val 


Tvr 


Tvr 


Tvr 


Met 


Arcr 


Ser 


Leu 


Ala 


Ala 


Ser 


Asn 


Pro 


He 


Leu 


7 85 










790 










795 










800 


Thr 


Ala 


Lvs 


Glu 


Ser 


Leu 


Met 


Ser 


Leu 


Phe 


Glu 


Glu 


Thr 


Lvs 


Arcr 


Lvs 










805 










810 










815 




Ala 


Glu 


Gin 


Met 


Glu 


Lvs 


Lvs 


Gin 


His 


Glu 


Glu 


Phe 


Asp 


Leu 


Ser 


Pro 








820 










825 










830 






Asp 


Gin 


Tro 


Arg 


Lys 


Glv 


Lys 


Lys 


Ser 


Thr 


Phe 


Arcr 


His 


Val 


Glv 

UJ> Jr 


Asp 






835 










840 










845 








Asp 


Thr 


Thr 


Arcr 


Leu 


Glu 


He 


Trp 


He 


His 


Pro 


Ser 


His 


Pro 


Ara 


Ser 




850 










855 










860 










Ser 


Gin 


Gly 


Thr 


Glu 


Ser 


Gly 


Lvs 

jr 


Asp 


Ser 


Glu 


Gin 


Glu 


Asn 


Gly 


Leu 


865 










870 










875 










880 


Gly 


Ser 


Leu 


Ser 


Pro 


Ser 


Asp 


Leu 


,Asn 


Lvs 


Arg 


Phe 


He 


Leu 


Ser 


Phe 










885 










890 










895 




Leu 


His 


Ala 


His 


Glv 

Jr 


Lvs 


Leu 


Phe 


Thr 


Arg 


He 


Gly 

w j- jr 


Met 


Glu 


Thr 


Phe 








900 










905 










910 






Pro 


Ala 


Val 


Ala 


Glu 


Lys 


Val 


Leu 


Lys 


Glu 


Phe 


Gin 


Val 


Leu 


Leu 


Gin 






915 










920 










925 








His 


Ser 


Pro 


Ser 


Pro 


He 


Glv 

vaj - Jr 


Ser 


Thr 


Ara 


Met 


Leu 


Gin 


Leu 


Met 


Thr 




930 










935 










940 










lie 


Asn 


Met 


Phe 


Ala 


Val 


His 


Asn 


Ser 


Gin 


Leu 


Lvs 


Asp 


Cvs 


Phe 


Ser 


945 










950 










955 










960 


Glu 


Glu 


Cvs 


Arq 


Ser 


Val 


He 


Gin 


Glu 


Gin 


Ala 


Ala 


Ala 


Leu 


Gly 


Leu 










965 










970 










975 




Ala 


Met 


Phe 


Ser 


Leu 


Leu 


Val 


Arg 


Arg 


Cvs 


Thr 


Cys 


Leu 


Leu 


Lvs 


Glu 








980 










985 










990 






Ser 


Ala 


Lys 


Ala 


Gin 


Leu 


Ser 


Ser 


Pro 


Glu 


Asp 


Gin 


Asp 


Asp 


Gin 


Asp 






995 










1000 








1005 






Asp 


lie 


Lys 


Val 


Ser 


Ser 


Phe 


Val 


Pro 


Asp 


Leu 


Lys 


Glu 


Leu 


Leu 


Pro 




1010 








1015 








1020 








Ser 


Val 


Lys 


Val 


Trp 


Ser 


Asp 


Trp 


Met 


Leu 


Gly 


Tyr 


Pro 


Asp 


Thr 


Trp 


1025 








1030 








1035 








1041 


Asn 


Pro 


Pro 


Pro 


Thr 


Ser 


Leu 


Asp 


Leu 


Pro 


Ser 


His 


Val 


Ala 


Val 


Asp 










1045 








1050 








1055 


Val 


Trp 


Ser 


Thr 


Leu 


Ala 


Asp 


Phe 


Cys 


Asn 


He 


Leu 


Thr 


Ala 


Val 


Asn 








1060 








1065 








1070 




Gin 


Ser 


Glu 


Val 


Pro 


Leu 


Tyr 


Lys 


Asp 


Pro 


Asp 


Asp 


Asp 


Leu 


Thr 


Leu 






1075 








1080 








1085 






Leu 


He 


Leu 


Glu 


Glu 


Asp 


Arg 


Leu 


Leu 


Ser 


Gly 


Phe 


Val 


Pro 


Leu 


Leu 




1090 








1095 








1100 








Ala 


Ala 


Pro 


Gin 


Asp 


Pro 


Cys 


Tyr 


Val 


Glu 


Lys 


Thr 


Ser Asp 


Lys 


Val 


1105 








1110 








1115 








112i 


lie 


Ala 


Ala 


Asp 


Cys 


Lys 


Arg 


Val 


Thr 


Val 


Leu 


Lys 


Tyr 


Phe 


Leu 


Glu 



1125 1130 1135 
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Ala Leu Cys Gly Gin Glu Glu Pro Leu Leu Ala Phe Lys Gly Gly Lys 

1140 1145 ~ 1150 

Tyr Val Ser Val Ala Pro Val Pro Asp Thr Met Gly Lys Glu Met Gly 

1155 1160 1165 

Ser Gin Glu Gly Thr Arg Leu Glu Asp Glu Glu Glu Asp Val Val lie 

1170 1175 1180 

Glu Asp Phe Glu Glu Asp Ser Glu Ala Glu Gly Ser Gly Gly Glu Asp 
1185 1190 1195 1200 

Asp lie Arg Glu Leu Arg Ala Lys Lys Leu Ala Leu Ala Arg Lys lie 

1205 1210 1215 

Ala Glu Gin Gin Arg Arg Gin Glu Lys lie Gin Ala Val Leu Glu Asp 

1220 1225 1230 

His Ser Gin Met Arg Gin Met Glu Leu Glu lie Arg Pro Leu Phe Leu 

1235 1240 1245 

Val Pro Asp Thr Asn Gly Phe lie Asp His Leu Ala Ser Leu Ala Arg 

1250 1255 1260 

Leu Leu Glu Ser Arg Lys Tyr lie Leu Val Val Pro Leu lie Val lie 
1265 1270 1275 1280 

Asn Glu Leu Asp Gly Leu Ala Lys Gly Gin Glu Thr Asp His Arg Ala 

1285 1290 1295 

Gly Gly Tyr Ala Arg Val Val Gin Glu Lys Ala Arg Lys Ser lie Glu 

1300 1305 ~ ' 1310 

Phe Leu Glu Gin Arg Phe Glu Ser Arg Asp Ser Cys Leu Arg Ala Leu 

1315 1320 1325 

Thr Ser Arg Gly Asn Glu Leu Glu Ser lie Ala Phe Arg Ser Glu Asp 

1330 1335 1340 

lie Thr Gly Gin Leu Gly Asn Asn Asp Asp Leu lie Leu Ser Cys Cys 
1345 1350 1355 1360 

Leu His Tyr Cys Lys Asp Lys Ala Lys Asp Phe Met Pro Ala Ser Lys 

1365 1370 1375 

Glu Glu Pro lie Arg Leu Leu Arg Glu Val Val Leu Leu Thr Asp Asp 

1380 1385 1390 

Arg Asn Leu Arg Val Lys Ala Leu Thr Arg Asn Val Pro Val Arg Asp 

1395 1400 1405 

lie Pro Ala Phe Leu Thr Trp Ala Gin Val Gly 
1410 1415 



<210> 38 
<211> 5965 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 38 

cctggctgcg cgcggcggtg gcggagccgc tacggctgta gcagcagccg cgaagatggc 60 

ggaagggctg gagcgtgtgc ggatctccgc gtcggagctg cgcgggatcc tggctactct 120 

ggccccgcag gccgggagca gagaaaacat gaaggaatta aaggaggcca ggccgcgcaa 180 

agataacagg cgtccagatc tggaaatcta taagcctggc ctttctcggc taaggaacaa 24 0 

gcccaaaatc aaggaacccc ctgggagtga ggaattcaaa gatgaaattg ttaatgaccg 300 

agattgctct gctgttgaaa atggtacaca gcccgttaaa gatgtctgca aggaactgaa 360 

caaccaagag cagaatggtc ctatagaccc agaaaataat cggggacaag aatcctttcc 420 

taggactgct ggacaagagg atcgtagtct aaaaattatc aaaagaacaa agaaacccga 4 80 

cctgcagatc tatcagcctg gacgacgttt gcagactgtt agcaaagaat ccgccagtcg 54 0 

ggtggaggag gaagaagtcc tcaaccaggt agaacaactg agagtagagg aagatgagtg 600 

taggggaaat gttgcgaagg aggaagttgc gaataaacca gacagggccg agatagaaaa 660 

gagcccaggt ggtgggagag taggggctgc aaaaggagaa aaaggaaaga ggatgggaaa 720 

aggggagggg gtgagggaaa cccacgacga cccggcccgc gggaggccgg gctccgcaaa 780 

gcgctactcc cgctcagaca aacgaaggaa tcgctaccgc acgcgcagca ccagctcagc 840 
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tggcagcaac aacagcgctg agggagctgg cctgacggat aatggatgtc gccgccgccg 900 
acaggatagg accaaggaga ggccaccact gaagaagcaa gtgtctgtgt cctcaaccga 960 

ttccttagac gaggacagaa ttgatgagcc tgatggatta ggacccagga gaagttcaga 1020 

aaggaagaga catttagaaa gaaactggtc tggccgtggg gagggtgagc agaaaaccag 1080 

tgctaaagaa tatcgaggca ctcttcgtgt cactttcgat gcagaagcca tgaacaaaga 1140 

gtctcccatg gtgaggtcag ccagggatga tatggataga ggaaagcctg acaaaggctt 1200 

gagcagtggg ggcaaaggct ctgagaagca ggagtccaaa aacccgaaac aagaacttcg 1260 

gggtcgtggt cgtggcattc tgattttgcc tgcccatacc accctatctg tcaattcagc 1320 

aggttctcca gagtccgcgc ctttgggacc tcggcttttg tttggatctg gtagtaaggg 1380 

atctcggagt tggggccgtg gaggcaccac acgccgattg tgggacccaa acaatcctga 1440 

tcagaaacct gctctaaaga ctcagacgcc ccagctacat ttcttggaca ctgatgatga 1500 

agtcagccct acatcttggg gtgactcacg ccaggctcag gcatcttact ataagtttca 1560 

aaactctgac aacccctatt attacccccg gacaccaggc cctgcctccc agtatcccta 1620 

tacgggctat aaccctctac agtacccagt gggccctacg aatggtgtgt acccagggcc 1680 

ttactaccca ggctacccga ctccgtcagg acagtatgtg tgtagccctc tacctaccag 1740 

caccatgagt cccgaggagg tagagcagca catgaggaac ctgcagcaac aggagctgca 1800 

caggcttctc cgggtggctg acaaccagga actgcagctc agcaacctgc tctccaggga 1860 

ccgcatcagt ccggagggcc tggagaagat ggcgcaactc agagctgaac tgctgcagct 1920 

atatgagcgc tgtattctat tagatattga gttctctgat aatcagaatg tggatcagat 1980 

cctgtggaag aatgctttct atcaggtgat tgagaagttc aggcaacttg tcaaggatcc 2040 

gaatgttgag aacccagaac agattcggaa cagacttttg gagctcttgg atgagggtag 2100 

tgacttcttt gatagtttgc ttcagaagct gcaggttact tacaagttca aactggaaga 2160 

ctacatggat ggtcttgcca ttcgcagcaa gccattacgc aagacagtaa aatatgcctt 2220 

gatcagtgcc cagcgatgca tgatatgcca aggagatatt gctaggtacc gggagcaagc 2280 

cagtgataca gcgaattatg ggaaagcacg cagttggtac ctgaaggccc agcacattgc 2340 

tcccaagaat gggcgcccct ataaccagtt ggctttgctg gcagtgtata cgaggaggaa 24 00 

gcttgacgct gtctattact atatgcgcag tttagctgcc agcaacccta tcctgactgc 24 60 

caaggagagt ctcatgagct tgtttgaaga gaccaagcgg aaggcagaac agatggaaaa 2520 

gaagcaacat gaggaatttg acctgagccc tgaccagtgg cggaaaggaa agaagtctac 2580 

tttccggcat gttggagatg acaccactcg cctggagatc tggattcatc catcccatcc 2640 

acggtcttcc cagggcactg agtctgggaa ggattctgag caagagaatg ggctgggcag 27 00 

cctgagtccc agtgatctga acaaaaggtt catcctcagt tttctccatg cccatgggaa 27 60 

gctgtttacc cggattggga tggagacatt ccctgcagtg gctgagaagg tcctcaagga 2820 

gttccaggtg ttactgcagc acagcccctc tcccattgga agtacccgca tgctgcagct 2880 

tatgaccatc aatatgtttg cagtacacaa ctcccagctg aaagactgct tctcggagga 2940 

gtgccgctct gtgatccagg aacaagccgc agctctgggc ttggccatgt tttctctact 3000 

ggtccgccgc tgcacctgct tacttaagga gtccgccaaa gctcagctgt cctctcctga 3060 

ggaccaggat gaccaagacg acatcaaggt gtcttccttt gtcccggacc tgaaggagct 3120 

gctccccagt gtcaaagtct ggtcagattg gatgctcggc tacccggaca cctggaatcc 3180 

tcctcccaca tccctggatc tgccctcgca tgttgctgtg gatgtatggt cgacgctggc 3240 

tgatttctgt aacatactga ctgcagtgaa tcagtctgag gtgccactgt acaaggaccc 3300 

ggatgatgac ctcacccttc ttatcctgga agaggatcgg cttctctcgg gctttgtccc 3360 

cttgctggct gcccctcagg acccctgcta cgtggagaaa acctcggata aggttattgc 3420 

agctgactgc aaaagggtca cagtgctgaa gtattttctg gaagcccttt gtggacaaga 3480 

agagcctctg ctggcattca agggtggaaa gtatgtgtca gtggcacccg tcccagacac 3540 

catgggaaag gaaatgggaa gccaagaggg aacacgactg gaagatgagg aggaggatgt 3600 

ggtgattgaa gactttgagg aagattcaga ggctgaaggc agcggaggcg aggatgacat 3660 

cagggagctt cgggccaaga agctggctct ggccaggaag atagctgagc agcagcgtcg 3720 

ccaggaaaag atccaggctg tcctggagga ccacagtcag atgaggcaga tggagctcga 37 80 

aatcagacct ttgttcctcg taccagacac caacggcttc attgaccacc tggccagtct 3840 

ggcgcggctg ctggagagca ggaagtacat cctggtggtg cccctcatcg tgatcaatga 3900 

gctggacggc ctggccaagg ggcaggagac agaccaccgg gctgggggct acgcccgtgt 3960 

ggtacaagag aaggcccgca agtccatcga gttcctcgag cagcgattcg agagtcggga 4020 

ctcttgcctg cgagccctga ccagccgtgg caatgaactc gaatccatcg ccttccgcag 4080 

tgaggacatc actggccagc tgggtaacaa cgatgatctc atcctgtcct gctgcctcca 4140 

ctactgcaaa gacaaggcta aggacttcat gcccgccagc aaagaggagc caatccggct 4200 

actgcgggag gtggtgctgt tgacggatga ccggaacctg cgtgtgaagg cgctcacaag 4260 

gaatgttcct gtacgggaca tcccagcctt cctcacgtgg gcccaggtgg gctgagggag 4320 

ccacactggg gccccccccc cccgtggaac cgttcctgaa aggccaccag gcgcccagtg 4380 

tagcacggaa gatgcccacg tgcctgagcc accaatccac ccagacaata aaccatcctc 4440 

ttccaaccca cgccacggcc atgctgtggg ggacctgctc ctcacagagc ccctcccaag 4500 
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gatcgggcgg aagctgctgg gaccctcctg ggctgccagg atttagcagg 'gaggtggctg 4560 

gctacagcaa cagcagctgg gcaagccaga taggccgccc atgctctcag cctttctccc 4 620 

tcccccgtct cattccaagg ctgagggagg gccttctcgc ctggggacgc agccactttc 4 680 

tccagtggag acagggcagg ggttcagagt ttccgtcaga tgcagtgaaa tcacagttcc 4740 

ctttcatctt cagaacctct gtcgtgaatg tgttcaagag gctttggtta agtcaggaag 4800 

aagtgcccag ggtgtgtgtc cccagtctcc ctgaggcctg gactcgccca tgaacccaag 4860 

tcggcttcta gacagcatgt ccctaacagc agccctgggc ccccacctct tctaccatcc 4 920 

accccagact taccacacac ccttcctgct gctcctcttc ctgcccttat caacctgggt 4 980 

ccctcacact tcgccagttg cgtccccgtg gacagtcatg agtctagagg aaaggggcat 5040 

ctggtctcag gcccgtgctc tcgggtggcc tccacctgct ccctttctcc tcactggcct 5100 

ttctttccgt ctagcctcct cttcaggaaa tgtcctgact ctcctcagct cccccttcac 5160 

ccctccttgc ccgcctaccc tccctccaga atagcccctc acccttcttc cccttctagt 5220 

tgatcctttt cacctccctg atccccttca tttcttcacc gcggttcctc gtcatagggg 5280 

ttctcactct gaactttccc tctctactac ccatggcagg aacctagtac aggtctccca 5340 

cccagggcct tccaccctcg gctcctgtgc tgggagaaac ttccaggcgt ggacagccca 5400 

gcctgaggca ttccagtgct ggggcaccgt cgcctaacct ggtttctagc tttgccctca 5460 

ctccccggaa aaactgacac tgacacaggg gccctttcct tgccccttta gctggtacct 5520 

cagtggggag gcttccttac caagaatgag ttcctgaaac ccagggccag agacaaggac 5580 

aacttagggg aagacggggt tttcggtgga gccaggggca aatcttaatg ggaccagtgg 5640 

gggatacccc agagcccatg gcctgactgc acagcctgcc tggaggatgg gtgcgcagct 5700 

ctgccctccc tgaggcccag gactatgcca gaagcgatgg ggtaccgtgt aggggagcca 5760 

aggccagtag tttgggggta ggagtcccct agagtctcag aagactgggc tctttggagt 5820 

acagggtccc cggcctctcc tttaagattc tctccccagc tggaaggccg atgactgggt 5880 

ggtcgggagg gagacccagc tctcctttct gtcccgtttg cagcactggt tttgtttcct 5940 

taataaattt ttagttatga aacat 5965 
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